The Adaptive s-step Conjugate Gradient Method
نویسنده
چکیده
On modern large-scale parallel computers, the performance of Krylov subspace iterative methods is limited by global synchronization. This has inspired the development of s-step (or communication-avoiding) Krylov subspace method variants, in which iterations are computed in blocks of s. This reformulation can reduce the number of global synchronizations per iteration by a factor of O(s), and has been shown to produce speedups in practical settings. Although the s-step variants are mathematically equivalent to their classical counterparts, they can behave quite differently in finite precision depending on the parameter s. If s is chosen too large, the s-step method can suffer a convergence delay and a decrease in attainable accuracy relative to the classical method. This makes it difficult for a potential user of such methods the s value that minimizes the time per iteration may not be the best s for minimizing the overall time-to-solution, and further may cause an unacceptable decrease in accuracy. Towards improving the reliability and usability of s-step Krylov subspace methods, in this work we derive the adaptive s-step CG method, a variable s-step CG method where in block k, the parameter sk is determined automatically such that a user-specified accuracy is attainable. The method for determining sk is based on a bound on growth of the residual gap within block k, from which we derive a constraint on the condition numbers of the computed O(sk)-dimensional Krylov subspace bases. The computations required for determining the block size sk can be performed without increasing the number of global synchronizations per block. Our numerical experiments demonstrate that the adaptive s-step CG method is able to attain up to the same accuracy as classical CG while still significantly reducing the total number of global synchronizations.
منابع مشابه
Global conjugate gradient method for solving large general Sylvester matrix equation
In this paper, an iterative method is proposed for solving large general Sylvester matrix equation $AXB+CXD = E$, where $A in R^{ntimes n}$ , $C in R^{ntimes n}$ , $B in R^{stimes s}$ and $D in R^{stimes s}$ are given matrices and $X in R^{stimes s}$ is the unknown matrix. We present a global conjugate gradient (GL-CG) algo- rithm for solving linear system of equations with multiple right-han...
متن کاملAdaptive Stochastic Conjugate Gradient Optimization for Temporal Medical Image Registration
We propose an Adaptive Stochastic Conjugate Gradient (ASCG) optimization algorithm for temporal medical image registration. This method combines the advantages of Conjugate Gradient (CG) method and Adaptive Stochastic Gradient Descent (ASGD) method. The main idea is that the search direction of ASGD is replaced by stochastic approximations of the conjugate gradient of the cost function. In addi...
متن کاملFirst International Conference and Exhibition Digital Signal Processing ( DSP ' 98 ) Subspace Adaptive Algorithm For Blind Separation Of Convolutive Mixtures By Conjugate Gradient Method
In this paper, a new subspace adaptive algorithm, for blind separation of convolutive mixture, is proposed. This algorithm can be decomposed into two steps: At rst, the convolutive mixture will be reduced to an instantaneous mixture (memoryless mixture), using a second-order statistics criterion based on subspace approach. The second step consists on the separation of the residual instantaneous...
متن کاملAn Improved Conjugate Gradient Based Learning Algorithm for Back Propagation Neural Networks
The conjugate gradient optimization algorithm is combined with the modified back propagation algorithm to yield a computationally efficient algorithm for training multilayer perceptron (MLP) networks (CGFR/AG). The computational efficiency is enhanced by adaptively modifying initial search direction as described in the following steps: (1) Modification on standard back propagation algorithm by ...
متن کاملAn eigenvalue study on the sufficient descent property of a modified Polak-Ribière-Polyak conjugate gradient method
Based on an eigenvalue analysis, a new proof for the sufficient descent property of the modified Polak-Ribière-Polyak conjugate gradient method proposed by Yu et al. is presented.
متن کاملRobust Adaptive Beamforming Based on a Gradient Projection Method
Recently, adaptive beamforming has been widely used in wireless communications, microphone array speech processing and so on. One of the adaptive beamforming methods is directionally constrained minimization of power. However, this method is known to degrade if some of underlying assumptions on the environment, sources, or sensor array become violated [5]. To resolve this disadvantage, some met...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1701.03989 شماره
صفحات -
تاریخ انتشار 2017